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AGE: 



mVVT SET: S3 5223, raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



1 

2 

3 (1) General Information: 
4 

.> 5 (i) APPLICANT: 



SEQUENCE LISTING 



) APPLICANT: C M 1— 

(A) name: Hodgson, John C l\| f F R C 

(B) STREET: Great West Road " fi-» Fg ^ L/ 



■> 6 
-> 7 

.> 8 (C) CITY: Brentford 

9 (0) STATE OR PROVINCE: Middlesex 

_> 10 (E) COUNTRY: United Kingdom 

.> 11 (F) POSTAL CODE: TW8 9EP 

12 

_> 14 (A) NAME: Hodgson, John 

_> 15 (B) STREET: Great West Road 

_> 16 (C) CITY: Brentford 

17 (0) STATE OR PROVINCE: Middlesex 

_> 18 (E) COUNTRY: United Kingdom 

_> 19 (P) POSTAL CODE: TW8 9EP 

20 

21 (ii) TITLE OF INVENTION: Novel tRNA Synthetase 

22 
23 

24 (iii) NUMBER OF SEQUENCES: 2 
25 

26 (V) COMPUTER READABLE FORM: 

27 (A) MEDIUM TYPE: Diskette 

28 (B) COMPUTER: IBM Compatible 

29 (C) OPERATING SYSTEM: DOS 

30 (D) SOFTWARE: FastSEQ for Windows Version 2.0 
31 

32 (vi) CURRENT APPLICATION DATA: 

33 (A) APPLICATION NUMBER: 09/302,024 

34 (B) FILING DATE: 
35 

36 (vii) PRIOR APPLICATION DATA: 

37 (A) APPLICATION NUMBER: US/08/785,455 

38 (B) FILING DATE: 
39 

40 (2) INFORMATION FOR SEQ ID N0:1: 
41 

42 (i) SEQUENCE CHARACTERISTICS: 

43 (A) LENGTH: 1974 base pairs 

44 (B) TYPE: nucleic acid 

45 (C) STRANDEDNESS: double 

46 (D) TOPOLOGY: linear 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/302,024 



(ii) MOLECULE TYPE: Genomic DNA 



DATE: 04/05/2000 
TIME: 17:05:34 



INPUT SET: S35223.raw 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



ATGGCTAAAG 

ATAGGACATG 

GGATATGATG 

GCTCAAAAAG 

CAATTGTGGG 

CATAAACATG 

GGTGAATATG 

GTAGACCCAC 

GAAGTTGAAC 

TTATTAGAGT 

ATGATTAACA 

AACTGGGGTG 

TTAGTTAACT 

TACTGGCCAG 

TGGCCTATTT 

TGGATTTTGA 

ATTTTAATTG 

TTTGGTTCAG 

GCAAATGACT 

GGCGAATTAC 

GCTTTAGAAA 

TTATCTACGG 

TGGGTATTAG 

GTTGAAAATA 

GAGATTTTTG 

CAATATGGTG 

AGATTGGATA 

GAAGAGGAAA 

GTTGAAATTA 

TTAAAAATTC 

TTCTATACAC 

GCTAAATTAA 

TTAACCTTAG 



AAACATTTTA 

CATATTCTAC 

TTCGTTATTT 

CTGGTAAGAC 

CTAAGCTTGA 

TCGTTGAGCA 

AAGGTTGGTA 

AATACGAAAA 

TAGTTAAAGA 

TCTATGACCA 

ACTTCATTAA 

TCCATGTTCC 

ATATTTCAGC 

CAGATATTCA 

TATTOATGOC 

TGAAAGATGG 

ATCGCTATGG 

ATGGCGTATT 

TAGGTAACTT 

CAGCGTATCA 

CAGTGAAAAG 

TATGGAAGTT 

CTAAGGACGA 

TTCGTTATGC 

AACAATTGAA 

TGCTTACTGA 

GCGAAGCGGA 

AAGAAGAGAT 

AGGCAGCAAC 

AAGTAGACTT 

CAGATGATAT 

TGGGACAAAA 

TAAGTTTACC 



TATAACAACC 

AGTGGCTGGA 

GACTGGAACG 

AGAAATTGAA 

AATTTCAAAT 

AGTGTTTGAA 

TTCTGTTCCG 

CGGTAAAATT 

AGAAAGTTAT 

AAATCCAGAT 

ACCAGGACTT 

GTCTAATCCA 

ATTAGGCTAT 

TTTAATGGCT 

ATTAGACTTA 

AAAAATGAGT 

TTTAGATGCT 

TACACCTGAA 

AGTAAACCGT 

AGGTCCACTT 

CTACACTGAA 

TATAAGTAGA 

TAGCCAAAAA 

AGCTGTATTA 

CATAAACAAT 

GTCAATTATG 

AATTGCATAT 

TCCTAGCAAA 

GATTATTGAT 

AGATTCTGAA 

TATTGGTAAA 

ATCTOAAGGT 

AAGTGCAATT 



CCAATATACT 

GATGTTATTG 

GATGAACACG 

TATTTGGATG 

GATGATTTTA 

CGTTTATTAA 

GATGAAACAT 

ATTGGTGGCA 

TTCTTTAATA 

TTTATACAAC 

GCTGATTTAG 

AAACATGTTG 

TTATCAQATG 

AAGGAAATTG 

CCGTTACCTA 

AAATCTAAAG 

ACACGTTATT 

GCATTTGTTG 

ACGATTTCTA 

CATGAATTAG 

AGCATGGAAA 

ACGAATAAGT 

GATATGTTAG 

TTACGTCCAT 

CCTCAATTTA 

GTTACTGGGC 

ATCAAAGAAT 

CCTCAAATTG 

GCTGAACATG 

CAAAGACAAA 

AAAGTAGCAG 

ATGATATTAT 

CCAAATGGTG 



ATCCTAGTGG 

CAAGATATAA 

GTCAAAAAAT 

AGATGATTGC 

TCAGAACAAC 

AGCAAGGTGA 

ACTATACAGA 

AAAGTCCAGA 

TTAGTAAATA 

CACCATCAAG 

CTGTTTCTCG 

TTTATGTTTG 

ATGAGTCACT 

TGCGATTCCA 

AAAAAGTCTT 

GTAATGTTGT 

ATCTAATGCG 

AGCGTACAAA 

TGGTTAATAA 

ATGAAGAAAT 

GTTTGCAATT 

ATATTGACGA 

GCAATGTAAT 

TCTTAACACA 

TGGAATTTAG 

AACCTAAACC 

CAATGCAACC 

ATATTAAAGA 

TTAAGAAGTC 

TTGTATCAGG 

TTGTTACTAA 

CTGCTGAAAA 

CAGTGATTAA 



GAATTTACAT 

GAGAATGCAA 

TCAAGAAAAA 

TGGAATTAAA 

TGAAGAACGT 

TATCTATTTA 

GTCACAATTA 

TTCTGGACAC 

TACAGACCGT 

AAAAAATGAA 

TACATCATTT 

GATTGATGCG 

ATTTAACAAA 

CTCAATTATT 

TGCACATGGT 

AGACCCTAAT 

TGAATTACCA 

TTTCGATCTA 

GTACTTTGAT 

GGAAGCTATG 

TTCTGTGGCA 

AACAACGCCT 

GGCTCACTTA 

TGCGCCGAAA 

TAGTTTAGAG 

TATTTTCCCA 

GCCTGCTACT 

CTTTGATAAA 

AGATAAGCTT 

AATTGCCAAA 

CCTGAAACCA 

AGATGGTGTA 

ATAA 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Lys Glu Thr Phe Tyr lie Thr Thr Pro He Tyr Tyr Pro Ser 
15 10 15 
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120 
180 
240 
300 
360 
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720 
780 
840 
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1974 



PAGE: 3 



DATE: 04/05/2000 
TIME: 17:05:34 



iNPUTSET: S35223,raw 

100 Gly Asn Leu His He Gly His Ala Tyr Ser Thr Val Ala Gly Asp Val 

101 20 25 30 

102 He Ala Arg Tyr Lys Arg Met Gin Gly Tyr Asp Val Arg Tyr Leu Thr 

103 35 40 45 

104 Gly Thr Asp Glu His Gly Gin Lys He Gin Glu Lys Ala Gin Lys Ala 

105 50 55 60 

106 Gly Lys Thr Glu He Glu Tyr Leu Asp Glu Met He Ala Gly He Lys 

107 65 70 75 80 

108 Gin Leu Trp Ala Lys Leu Glu He Ser Asn Asp Asp Phe He Arg Thr 

109 85 90 95 

110 Thr Glu Glu Arg His Lys His Val Val Glu Gin Val Phe Glu Arg Leu 
HI 100 105 110 

112 Leu Lys Gin Gly Asp He Tyr Leu Gly Glu Tyr Glu Gly Trp Tyr Ser 

113 115 120 125 

114 Val Pro Asp Glu Thr Tyr Tyr Thr Glu Ser Gin Leu Val Asp Pro Gin 

115 130 135 140 

116 Tyr Glu Asn Gly Lys He He Gly Gly Lys Ser Pro Asp Ser Gly His 

117 145 150 155 160 

118 Glu Val Glu Leu Val Lys Glu Glu Ser Tyr Phe Phe Asn He Ser Lys 

119 165 170 175 

120 Tyr Thr Asp Arg Leu Leu Glu Phe Tyr Asp Gin Asn Pro Asp Phe He 

121 180 185 190 

122 Gin Pro Pro Ser Arg Lys Asn Glu Met He Asn Asn Phe He Lys Pro 

123 195 200 205 

124 Gly Leu Ala Asp Leu Ala Val Ser Arg Thr Ser Phe Asn Trp Gly Val 

125 210 215 220 

126 His val Pro Ser Asn Pro Lys His Val Val Tyr Val Trp He Asp Ala 

127 225 230 235 240 

128 Leu Val Asn Tyr He Ser Ala Leu Gly Tyr Leu Ser Asp Asp Glu Ser 

129 245 250 255 

130 Leu Phe Asn Lys Tyr Trp Pro Ala Asp He His Leu Met Ala Lys Glu 

131 260 265 270 

132 He Val Arg Phe His Ser He He Trp Pro He Leu Leu Met Ala Leu 

133 275 280 285 

134 Asp Leu Pro Leu Pro Lys Lys Val Phe Ala His Gly Trp He Leu Met 

135 290 295 300 

136 Lys Asp Gly Lys Met Ser Lys Ser Lys Gly Asn Val Val Asp Pro Asn 

137 305 310 315 320 

138 He Leu He Asp Arg Tyr Gly Leu Asp Ala Thr Arg Tyr Tyr Leu Met 

139 325 330 335 

140 Arg Glu Leu Pro Phe Gly Ser Asp Gly Val Phe Thr Pro Glu Ala Phe 

141 340 345 350 

142 Val Glu Arg Thr Asn Phe Asp Leu Ala Asn Asp Leu Gly Asn Leu Val 

143 355 360 365 

144 Asn Arg Thr He Ser Met Val Asn Lys Tyr Phe Asp Gly Glu Leu Pro 

145 370 375 380 

146 Ala Tyr Gin Gly Pro Leu His Glu Leu Asp Glu Glu Met Glu Ala Met 

147 385 390 395 400 

148 Ala Leu Glu Thr Val Lys Ser Tyr Thr Glu Ser Met Glu Ser Leu Gin 

149 405 410 415 

150 Phe Ser Val Ala Leu Ser Thr Val Trp Lys Phe He Ser Arg Thr Asn 

151 420 425 430 

152 Lys Tyr He Asp Glu Thr Thr Pro Trp Val Leu Ala Lys Asp Asp Ser 
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DATE: 04/05/2000 
TIME: 17:05:34 



153 




435 






154 


Gin Lys 


Asp 


Met 


Leu 


155 


450 








156 


Arg Tyr 


Ala 


Ala 


Val 


157 


465 








158 


Glu He 


Phe 


Glu 


Gin 


159 








485 


160 


Ser ser 


Leu 


Glu 


Gin 


161 






500 




162 


Gly Gin 


Pro 


Lys 


Pro 


163 




515 






164 


Ala Tyr 


He 


Lys 


Glu 


165 


530 








166 


Glu Glu 


He 


Pro 


Ser 


167 


545 








168 


Val Glu 


He 


Lys 


Ala 


169 








565 


170 


Ser Asp 


Lys 


Leu 


Leu 


171 






580 




172 


Gin He 


Val 


Ser 


Gly 


173 




595 






174 


Gly Lys Lys 


Val 


Ala 


175 


610 








176 


Gly Gin Lys 


Ser 


Glu 


177 


625 








178 


Leu Thr 


Leu 


Val 


Ser 


179 








645 


180 


Lys 








181 










182 










183 











440 



Gly 


Asn 


Val 


Met 


Ala 


His 




455 










Leu 


Leu 


Arg 


Pro 


Phe 


Leu 


470 










475 


Leu 


Asn 


He 


Asn 


Asn 


Pro 










490 




Tyr 


Gly 


Val 


Leu 


Thr 


Glu 








505 






He 


Phe 


Pro 


Arg 


Leu 


Asp 






520 








Ser 


Met 


Gin 


Pro 


Pro 


Ala 




535 










Lys 


Pro 


Gin 


He 


Asp 


He 


550 










555 


Ala 


Thr 


He 


He 


Asp 


Ala 










570 




Lys 


He 


Gin 


val 


Asp 


Leu 








585 






He 


Ala 


Lys 


Phe 


Tyr 


Thr 






600 








Val 


Val 


Thr 


Asn 


Leu 


Lys 




615 










Gly Met 


He 


Leu 


Ser 


Ala 


630 










635 


Leu 


Pro 


Ser 


Ala 


He 


Pro 










650 





INPUT SET: S35223,mw 

445 



Leu 


Val 


Glu 


Asn 


He 


460 










Thr 


His 


Ala 


Pro 


Lys 










480 


Gin 


Phe 


Met 


Glu 


Phe 








495 




Ser 


He 


Met 


Val 


Thr 






510 






Ser 


Glu 


Ala 


Glu 


He 




525 








Thr 


Glu 


Glu 


Glu 


Lys 


540 










Lys 


Asp 


Phe 


Asp 


Lys 










560 


Glu 


His 


Val 


Lys 


Lys 








575 




Asp 


Ser 


Glu 


Gin 


Arg 






590 






Pro 


Asp 


Asp 


He 


He 




605 








Pro 


Ala 


Lys 


Leu 


Met 


620 










Glu 


Lys 


Asp 


Gly 


Val 










640 


Asn 


Gly 


Ala 


Val 


He 








655 





